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1 Introduction 

Various natural population genetics models lead to a representation of the ge- 
nealogical tree by a process called Kingman's coalescent fJGl [T7] . Kingman's 
coalescent is a Markov process which can be informally described as follows: in a 
fixed sample of n individuals from the population, each pair of ancestral lineages 
coalesces at rate 1. 

In population genetics, one uses the above process to quantify polymorphism 
in a homogeneously mixing population under neutral evolution. However, there is 
some evidence that for modeling evolution of marine populations (see, e.g., |I9]), 
the use of coalescent processes which allow multiple collisions is more appropriate 
than that of Kingman's coalescent, where only pairs of blocks can merge at any 
given time. Similarly, multiple collisions are natural for modeling evolution of 
viral populations, where natural selection plays a very strong role. They also 
emerge in the fine-scale mapping of disease loci [21] . 

A suitable family of mathematical models has been introduced and studied 
by Pitman [22] and Sagitov [26] under the name A-coalescents or coalescents with 
multiple collisions. We postpone the precise definitions of these processes until 
the next section. 

Let A^^ = N := (Nf, t > 0) be the number of blocks process corresponding 
to a particular A-coalescent process. In view of applications, we concentrate on 
A-coalescents such that P{Nt < oo, t > 0) = 1 and limt^o+ A"* = oo (here A^ is 
really an entrance law). This property is typically referred to as coming down from 
infinity (see Section [2^21 for a formal definition). It is important to understand the 
nature of divergence of A'f as t decreases to 0. In the current paper, our goal is to 
exhibit a function v : (0, oo) — > (0, oo) such that 

Ihn A = 1, almost surely. 

t-0 v{t) 

We call any such v the speed of coming down from infinity (speed of GDI) for 
the corresponding A-coalescent. Note that the limit above is in fact the limit as 
t —>■ 0+, from now on we always write t -^ 0. The exact form of the function 
V is implicit and somewhat technical, see Theorem [1] for the precise statement. 
However, in many situations of interest, one can find a simpler function g{t), of- 
ten a power in t, such that g{t)/v{t) — >• 1, and therefore Nt/g(t) ^ 1, as t — »• 0. 
Then we also refer to g as the speed of GDI for the corresponding coalescent. As 
mentioned above, Kingman's coalescent is the simplest A-coalescent. In partic- 
ular, one can quickly find its speed of GDI by considering the "time-reversed" 
process. Analogous time-reversals for general A-coalescents seem to be difficult to 
grasp. The speed of GDI was recently determined for Beta-coalescents and their 



"perturbations" in Berestycki et al. [5] - [4] and Bertoin and Le Gall [6] (where 
convergence is established in probability). See also the comment following the 
statement of Theorem [1] below. 

With the above biology motivation in mind, there is a strong interest in un- 
derstanding (see, e.g., ^0[ [131 [20]) analogues of Ewens' sampling formula for 
A-coalescents. It seems that only Kingman's coalescent allows for exact solution 
(see, e.g., [H] or [12]). while in the general case one should aim for good approx- 
imations. The only previous detailed analysis of this kind was carried out in [5] 
and ^J for the special case of Beta-coalescents. The above result can be viewed as 
the first step towards analogous understanding of the general A-coalescent case. 

In a parallel work [3] we discuss the consequence of our main results to the 
problem of quantifying polymorphism in a population whose genealogy is driven by 
a coalescent with multiple collisions. In the same paper, we will describe a general 
connection between the small-time asymptotics of A-coalescents and continuous 
random trees and their associated continuous-state branching processes, as well 
as generalized Fleming- Viot processes. These connections enable one to guess 
the form of function v{t), and they imply the convergence in probability of the 
quantity Nt/v{t) of interest under certain technical conditions. They can also be 
useful in determining the power law order of growth of f as t ^ 0. 

To the best of our knowledge, the martingale analysis in the current context 
is novel. We believe that it is of independent interest. Although similar in spirit, 
our setting is different from the general setting of Darling and Norris [8] . For their 
technique to apply, it is necessary to start with good bounds on the accumulated 
absolute difference of the "drifts" of the Markov chain and the solution to the 
corresponding differential "fluid-limit" equation. Here it seems difficult to obtain 
such bounds, however it is possible to work directly (cf. the local martingale M'^ 
from f[22|) ) with the accumulated (non- absolute) difference of the drifts in order 
to obtain sufficiently good asymptotic estimates. 

The rest of the paper is organized as follows: Section [2] contains definitions 
and notations, the main results are stated in Section [3] and proved in Section [U 
with some technical estimates postponed until the Appendix. 

2 Definitions and preliminaries 

2.1 Notation 

We recall some standard notation, and introduce additional notation to simplify 
the exposition. 

Denote the set of real (resp. rational) numbers by M (resp. Q) and set 



]R+ = (0,00). For a,b & M., denote by a A 6 (resp. a \/ b) the minimuin 
(resp. maximum) of the two numbers. 

Let N := {1, 2, . . .} and let V be the set of partitions of N. Furthermore, for 
n G N denote by Vn the set of partitions of [n] := {1, . . . , n}. 

If / is a function, defined in a left-neighborhood (s — e, s) of a point s, we 
denote by f{s—) the left limit of / at s. 

Given two functions f,g : M+ -^ M+, write / = 0{g) if limsup /(x)/5'(x) < 
00, / = o{g) if limsup f{x)/g{x) = 0, and f ^ g iilim. f{x)/g{x) = 1. The 
point at which the limits are taken might vary, depending on the context. 

If X and Y are two random objects we write X = Y to indicate their 
equivalence in distribution. As usual, convergence in distribution will be 
denoted by =^ symbol. 

If ^ = {J^t,t > 0) is a filtration, and T is a stopping time relative to JF, 
denote by J^t the standard filtration generated by T, see for example [TT], 
page 389. 

For u a finite or cr-finite measure, denote the support of u by supp(z/). 

2.2 A-coalescents 

Let A be a finite measure on [0, 1]. The A-coalescent is a Markov process (Ilf , t > 0) 
with values in V (the set of partitions of N) , characterized as follows. If n G N, 
then the restriction (IlJ , t > 0) of (11^,^ > 0) to [n] is a Markov chain, taking 
values in Vn, with a following dynamics: whenever IIJ is a partition consisting 
of b blocks, the rate at which a given fc-tuple of its blocks merges is 

x''-\l-xf-''A{dx). (1) 



'[0,1] 

Note that mergers of several blocks into one block are possible, but multiple 
mergers do not occur simultaneously. For a generalization of A-coalescents where 
multiple mergers are possible see Schweinsberg [29j. For a generalization of A- 
coalescents to spatial (not a mean- field) setting see Limic and Sturm [18]. 

We will quote here several basic properties of the A-coalescent, and refer the 
reader to Pitman [22] for details and additional analysis. When A({0}) = 0, the 



corresponding A-coalescent can be constructed via a Poisson point process in the 
following way. Let 

be a Poisson point process on M+ x (0, 1) with intensity measure dt®v{dx) where 
v{dx) = x~'^A{dx). Each atom {t,x) of vr influences the evolution of the process 
n as follows: for each block of n(t— ) flip a coin with probability of heads equal 
to x; all the blocks corresponding to coins that come up "head" then merge 
immediately into one single block, while all other blocks remain unchanged. Note 
that in order to make this construction rigorous, one first considers the restrictions 
(n'^"^(t),t > 0), since the measure u(dx) = x~'^A{dx) may have infinite total mass. 

We next recall a remarkable property of A-coalescents. Let E be the event 
that for all t > there are infinitely many blocks, and let F be the event that 
for all t > there are only finitely many blocks. Pitman [22] showed that, if 
A({1}) = 0, only the following two types of behavior are possible, depending on 
the measure A: either P{E) = 1 or P{F) = 1. When P{F) = 1, the process 
n is said to come down from infinity. For instance, Kingman's coalescent comes 
down from infinity, while if A{dx) = dx is the uniform measure on (0,1), then 
the corresponding A-coalescent does not come down from infinity. This particular 
A-coalescent was discovered by Bolthausen and Sznitman [^ in connection with 
spin glasses. 

A necessary and sufficient condition for a A-coalescent to come down from 
infinity was given by Schweinsberg [28]: define 



7fe = ^{k - 1) ( ^ ) Ab,fc, 



then the A-coalescent comes down from infinity if and only if X]fel2 7fe~^ ^ '^• 

Recently, Bertoin and Le Gall [6] observed that this condition is equivalent to 
the following requirement: define 

Mq) = HQ)-= [ ie-'''-l + qxMdx), (3) 

^[0,1] 

where v{dx) = x~'^A(dx), then 



OO pQ 

y 7{^^ < OO if and only if / 

6=2 "^" 



ip{q) 



< OO, (4) 



where the right-hand side is finite for some (and then automatically for all) a > 0. 
Somewhat remarkably, the divergence rate function v is given (cf. definition ([8]) 



in the next section) in terms of the right hand side in (jlj). The condition (j4]) is 
well-known in the Levy processes literature as the Grey's criterion for extinction 
of the underlying continuous-state branching process. We refer the reader to [3] 
for further explanation of the above connections. 

3 Main results 

Let A be a finite measure on [0, 1], and let (Hf, t > 0) be a A-coalescent. Without 
loss of generality we may and will henceforth assume that A is a probability 
measure, i.e., 

A[0,1] = L (5) 

Indeed, a scaling of the total mass of A by a constant factor will induce the scaling 
of the speed of evolution (and therefore, that of coming down from infinity) by 
the same factor, and the speed of GDI v from ([H]) below will scale in the same 
way. 

To each such measure A we associate a function ip defined in ([3]). Moreover, 
for a probability measure A of the form A = (1 — c)A + c6q^ where A has no atom 
at 0, we may rewrite as 

M^l) = ^q' + (1 - c) /" (e-^^ - 1 + qxMdx). (6) 

'^ J [0,1] 

Note that if c = 1 we retrieve the Kingman coalescent, whose small time behavior 
is well- understood. Henceforth we assume that c < 1. 

When ijj is such that the integral in (jl]) is finite, or equivalently, when the 
corresponding A-coalescent comes down from infinity, we can define 

C^ da 

u4t) = u{t):= J ^G(0,oo), t>0, (7) 

and its cadlag inverse 

vjt) = v(t):=mi\s>0: -—-dq<t\,t>0. (8) 

I Js ■^{q) J 

Denote by {N^(t),t > 0) = {N^,t > 0) the number of blocks process for the 
A-coalescent {Il{t),t > 0). The first main result of this paper is 

Theorem 1. 

lim —— = 1, almost surely. (9) 



Note that if 11 does not come from infinity, both N^^ = N^(t) = oo, for all t > 0, 
almost surely, and the formal definition ([8]) yields f^ = oo, so ([9]) extends trivially 
if oo/oo = 1. 

We next comment on some special cases of Theorem [H When A = (5o, we have 
v{t) = 2/t, and we recover the well-known result that for Kingman's coalescent, 
the number of blocks is almost surely asymptotic to 2/t. Another interesting case 
occurs when A has the Beta(2 — a, a) distribution for some 1 < a < 2. That is, 

A{dx) = \ x^-"(l - xT-' dx (10) 

1 (2 — a)i {a) 

Here it is not hard to see that il){q) ~ Cig" as g ^ oo, and thus that 

t;(t) ~C2t-^/(°-^), ast-^O, 

where Ci = (r(a)a(a — 1))~^ and C2 = {aT[a))~^^'^"'~^\ In fact these calculations 
can easily be generalized to the case where A is regularly varying near with 
index 1 < a < 2. In this case. Theorem [T] strengthens Lemma 3 in [S]. 

However, we emphasize that the most delicate case of the above theorem occurs 
when the measure A "wildly oscillates" in any neighborhood of 0. An example 
of such a measure is constructed in the appendix of [3]. It illustrates potential 
difficulties in the analysis of functions ■?/', m, or v directly. 

With a bit more work we obtain as the second main result an analogue to 
Theorem [1] in terms of convergence of moments. 



Theorem 2. For any d G [1, oo), 



\m\E sup 
^^0 \ te[o,s] 



N^{t) '^ 



v^{t) 



0. (11) 



The following consequence of Theorem [T] says that, among all the A-coalescents 
such that A[0, 1] = 1, Kingman's coalescent is extremal for the speed of coming 
down from infinity. 

Corollary 3. Assume ^. Then with probability 1, for any e > 0, and for all t 
sufficiently small, 

N\t)>-^{l-s). 

Proof. Without loss of generality assume that the A-coalescent comes down from 
infinity. To see how the corollary follows from Theorem [1], observe that since 
^-qx ^i _ g^ _^ q'^x'^/2 for x > 0, 



If 2 

^(g) < — / x^v{dx) < — (due to ©). (12) 

2 Jfo.il 2 



Hence 

'■°° 2 2 , . 2 



u,l,is) > —dq = - and v^{t) > -. (13) 

J s Q ^ ^ 

Due to Theorem [H A^^(t) ~ t;^(t) as t ^ 0, implying that N^{t) > 2(1 - £)/t 
with high probabihty for all t small. D 

Remark 4. It is interesting to compare the last result with the following fact 

shown in Angel et al. [1], 

1 

N^{t)dt = oo, (14) 

'0 

regardless of the choice of the finite measure A. Corollary [3] may be used to give 
an alternative proof of ( IT4|) . 

The following result is interesting from the perspective of applications in pop- 
ulation genetics. More specifically, the total length of the coalescent tree is rele- 
vant for predicting the number of mutations in a large but finite sample. Let 
jyA.n (jgj^ote the number of blocks process of the restriction n^"-* with initial 
state IIq = {{1}, . . . , {n}}, as defined at the beginning of Section 12.21 Let 
r„ := inf{s > : N^{s) < n}, and let H^ := {N^{Tn) = n} be the event 
that the (unrestricted) A-coalescent ever attains a configuration with exactly 
n blocks. Then, due to the strong Markov property, the conditional law of 
{N^{s + Tn), s > 0) given J^m on the event if„, equals the law of A^^'"'. Let 
tn = u^{n) so that v^{tn) = n. 

Theorem 5. For each s > we have 

J'N^'''{t)dt /;ArA«(t)dt 

ii"^ f;v4tn + t)dt = ii"^ /;E(ArA"(t))c^t = ^' ^^ '^'''''''''y- 

For Kingman and Beta coalescents (that is, when A is of the form A = 6o or flU) 
with 1 < a < 2), the above convergence holds almost surely. 

Let r^" = inf{t > : A^'^'"(t) = 1}, so that j^^ N ^^"^ {t) dt equals the total 
length of the (A- ) coalescent tree with n leaves. Moreover, for any fixed s > 

I ' A^^'"(t) dt^ r N^{t) dt, almost surely, 

(see section 14. 4p where the limit is a finite random variable. Hence the above 
theorem yields the asymptotics for the total length of the coalescent (genealogical) 
tree. Some more detailed analysis is postponed until [3]. 
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Whereas Theorem [T] is a law of large numbers-type result for A^^, Theorem [5] is 
a law of large numbers-type result for f^^ N^'"'(t) dt. A central limit theorem for 
lengths of partial coalescent trees is obtained by Delmas, Dhersin and Siri- Jegousse 
[9] (see also [2]) for the Beta-coalescent case, similar questions for general A- 
coalescents remain open. 

4 Martingale based arguments 

We now proceed towards the proof of Theorem [H The following easy to check 
facts will be used in our analysis. 

Lemma 6. The function ip : [0, oo) -^ M+ of 1^ is (strictly) increasing on [0, oo), 
and convex on (0, oo). Furthermore, for v^ as in ^ we have f^(s) = —ip^v^^s)), 
so that v^ is decreasing with its derivative decreasing in absolute value. 

Due to Lemma [HI postponed until the next section, we can and will suppose 
without loss of generality that supp(A) C [0,1/4]. This assumption simplifies 
some technical estimates. 

In this section we write A^ instead of A^^ whenever not in risk of confusion, 
and we also abbreviate v = v^. We start by observing that the function v is the 
unique solution of the following integral equation 

\og{v{t)) - \og{v{z)) + [ tMi:)! dr = 0, yO<z<t, (15) 

Jz v{r) 

with the "initial condition" f(0+) = oo (see Lemma [9] for properties oi %lj{q)/q). 
It is then natural to consider, for each fixed z > 0, the process 

M{t) := \og{N{t)) - \og{N{z)) + f ^i^^^^dr, t > z. (16) 

Jz N{r) 

Let no > 1 be fixed. Define 

r„„ := inf{s > : N{s) < no}. (17) 

The following proposition tells us that M(t Ar^^) is "almost" (up to a bounded 
drift correction, and integrability condition) a martingale, with respect to the 
natural filtration (JF^, t > 0) generated by the underlying A-coalescent process. Its 
proof uses some general facts about binomial distributions, with precise statements 
and arguments postponed until the Appendix. In particular, in the rest of this 
section the parameter no is taken to be the integer no from Lemma | 



As usual, E[dXs\J^^ denotes the infinitesimal drift of a continuous-time process 
{Xsi s > 0) with respect to the filtration JF at time s. Similarly, we denote by 
E[{dXsY\J^!^ the corresponding infinitesimal second moment. That is, 

^l^^:=limlE[X,^,-X.|^,] 
as e^o e 



and 



as e^o e 



XsY\Tsl 



Proposition 7. There exists some deterministic C < oo such that 



E[d\og{N{s))\J^; 



Nis) 



+ h{s) ds 



where {h{s),s > z) is an J^ -adapted process such i/iat sup^gj^.zArn ] \^{^)\ — C*, and 

E[[d\og{N (s))]'^\J-'s]l{s<T„ } < C ds, almost surely. 

Both estimates are valid uniformly over ^ > 0. 

Proof. To prove the proposition, it suffices to show that for each s > 0, we have 
on {N{s) > no] 

E{d\ogiN{s))\^^^{N{s)) 



ds 



Nis) 



\h{s)\ = 0{ I pMdp)], (19) 

'[0,1/4] 



and 



E{[d\og{Nismj's) = 



p^v{dp) I ds 



[0,1/4] 



(20) 



where O(-) can be taken uniformly in s. Note that the finite integrals above are 
in fact taken over [0, 1], since v{dx) = i^{dx)l{x€[o,i/4:]} by assumption. 

Recall the Poisson point process construction of section 12.21 and fix n > tiq. 
If A({0}) = 0, then on the event {A^(s) = n} an atom of size p arrives at rate 
i'{dp)ds, and given that, logA^(s) = logn jumps to log(i?„^p + l{B„^p<n})) where 
Bn^p has Binomial (n, 1 — p) distribution. Hence we have 



E{dlog{N{s))\J^s 



E 



[0,1] 



log 



Bn,p + l{B„,p<n} 



n 



v{dp) ds. 



In the general case where A({0}) = c E (0, 1), we have on the same event 
E{d\og{N{s))\J's) 



ds 



1-c 



E 



[0,1] 



log 



Bn,p + l{B„.p<n} 

n 



/ 7 \ / ""■ \ 1 n — 1 
v[dp)+c\ \ log——. 
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Let V'ol?) = Q'^/2 be the function ip corresponding to the atomic K{dx) = So^dx). 
Note that 

'n\ n-1 c c ^.^ , , ipo(n) c ^.^ , , 

c| ^ I log — = --n + - + 0{l/n) = -c-^^^^ + - + 0{l/n). 



n 



n 



In view of ([6]), the estimate flTOl) will follow by Lemma [19] in the Appendix, pro- 
vided that 



|np-l + (l-p)"-(e-"P-l + np)| = |(1 - p)" - e-"P| < Cnp^ 



(21) 



for all n > uq, p < 1/4 and for some C < oo. Note that e "^ > (1 — p)" and in 
fact 

e""^ - (1 - p)" = e~"P ("l - exp j-n r^- + ^ + . . . 
Therefore, for p < 1/4 we have 



|-^ (y + y + • • •) } < 1 - exp {-^ {p'+p' + ..■)} 



< 1 — exp I — np 



< -np 



hence both (^ and (^ hold. 

To bound the infinitesimal variance on the event {N{s) = n}, use the second 
estimate in Lemma [191 together with the fact 



E{[d\og{N{smj^s) 
ds 



< (1-c) / E 

[0,1] 



log' 



2 Bn,p + l{B„,p<n} 



n 



2 I \2 



Finally, note that both bounds ([T9l) and (J20l) are uniform in the choice of 2;. D 

4.1 Proof of Theorem [1] 

Recall the process M from ([T6|) and define 



M^(t) = M'(t):=M(tAr„J + 



tAr,, 



h{r)dr, t > z, 



(22) 



so that M^ has martingale increments due to Proposition [71 A general property 
of Doob-Meyer martingale correction (that one can check easily) implies that 

E{[dM'^{t)Y\J^t) < E[[d\og{N{t))]'^\Tt], so that 

E{M'^{s) - M'^{z)f < C{s -z),yO<z< s, (23) 
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where C is the constant from Proposition [71 

Define a family of deterministic functions {vx-, x G M) by 

Vx{t) = v{t + x), t> —X, 

and note that each Vx satisfies an appropriate analogue of (ITSl) on its entire domain, 
namely, Vx{—x+) = oo and 

log{vx{t)) - log{vx{z)) + f t^!^^ dr = 0,y-x<z<t. (24) 



For each fixed z > and each x > —z define 

"* r^(iV(r)) ^K(r)) 



M,,, t := log -^ - log -^ + / 

^^a;(^) ^'x(^) Jz 



N{r) Vx{r) 



+ /i(r) 



dr, t > z, 



where /i is given in fITH]) . 

Moreover, given X G JF^ such that P(X > —z) = 1, we can consider the 
process M^ x- The advantage of this approach will be apparent soon. 

For fixed 2; > 0, the processes M'^,Mz,x and Mz^x are all adapted to the 
filtration (JF^, r > z). 

Remark 8. Strictly speaking, the processes M^, M^^x and Mz^x defined above are 
local martingales (see [23] Chap. II or [25] Chap. VI, 31-34 for definition and first 
properties) since we do not know a priori whether log(A^(t)) has finite expectation. 
However, the optional stopping and Doob moment estimates that we apply below, 
still hold in this more general setting. 

Lemma 9. The function q 1-^ i'{(i)/<l is increasing. 

Proof. Note that q 1-^ '4'{(i)/<l is smooth, and that its derivative at q equals 

ilj\q)q - ij{q) _ /(I - (xq + l)e-«^)z/(da;) _ /(I - {xq + l)e-''^)/x^A{dx) 
q2 q2 q2 

It is a simple matter to check that the integrand in the numerator is positive for 
all X > 0, and that its limit as x — > is g^/2 so again positive. 

The reader is invited to verify in a similar manner that limq^oo('^('?)/Q')" = 
— r ^, e~'^^xA[dx) which implies that q 1— > ip{q)/q is asymptotically concave. Our 
argument does not make use of this fact. D 

The following deterministic lemma is a crucial step in our analysis. It over- 
comes the need for a priori estimates necessary for the method of |8] to apply, as 
discussed in the Introduction. 
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Lemma 10. Suppose f,g : [a,b] 



are deterministic cddldg functions such that 



sup 

x(^[a,b] 



f{^) + / 9{u)du 



<c. 



(25) 



for some c < oo. If in addition f{x)g{x) > 0, x G [a, b] whenever f{x) ^ 0, then 
both 



sup 

xG[a,b] 



g{u) du 



< c and sup |/(a;)| < 2c. 

x£[a,b] 



Proof. Due to the assumptions we know that at any point x where f{x) is positive 
(resp. negative) h{x) := J g{u) du is increasing (resp. decreasing) from the right. 
Define ti := min{x G [a,b] : \h{x)\ > c}, with the convention that ti = 6 if this 
set is empty. Suppose ti < b. By continuity of h, it must be that |/i(ti)| = c. 
Without loss of generahty assume 



h(ti) = c, and hence h{ti + e) > c, 



(26) 



for all small enough e > 0. Having f{t) < for all t G {ti,ti + e) would imply that 
h is decreasing on that same interval, contradicting fl26|) . Therefore there exists 
t G {ti,ti + e) such that f{t) > 0. But since h{t) > c by fl26|) . this would in turn 
contradict fl25|) . Hence it must be ti = b, so that the uniform bound on |/i| holds, 
which together with ( l25l) implies the uniform bound on |/|. D 

Since N{t) ^ oo as t —>■ 0, almost surely, we have 

P{Tno > 0) = 1, or equivalently, lim P(rny < s) = 0. 

s— >0 

Therefore, for any family {Ys, s > 0) of random variables we have lims^o,s<T„ Ys = 
lim^^o ^s, almost surely (in the sense that whenever one of the limits exists so does 
the other). Without loss of generality we will henceforth write M^ .^.(t) instead of 
Mz^xit A r„(j), t G [z, s] instead of t G [z,s A r„Q], etc. 

Fix a* G (0, 1/2). By Doob's L^-inequality for martingales and (!23|) we have 



P( sup \M'M - M',{z)\ >sn< s"'°' sup E[{M'M - M',{z)y 
te[z,s] te[z,s] 

^-2"*C(S-Z)=0(s^~2a*. 



< s- 



(27) 



Denote by 



telz,s] 
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the complement of the above event. Henceforth we assume that s < (l/C)^-"* . 
Note that then J h{r) dr < J C dr < Cs < s" . So we obtain that on A'^ (hence 
with probabihty greater than 1 — 0(s^~^"*)) 



sup 

tG[2,s] 



log N {t)- log N{z)+ j 

J z 



N{r) 



dr 



<2s" . 



We conclude that A' C A,, where 



AJs) ^A, :-- 



sup 

ti,t2&[z,s\ 



logiV(t2)-logiV(ti)+ / 



*2 il){N{r-)) 
N{r) 



dr 



<As' 



(28) 

The advantage of the new definition is that A^-^ C A^^ whenever zi < Z2 < s. 
Moreover, the bound in ( 1271) is uniform in ^ G (0, s), hence the decreasing property 
of probabihty measures implies P{y^ze{<d,s)Az) = 



P I sup 

ti,t2e(o,s] 



logiV(t2) - logiV(ti) + / *' ti^^ dr 



<4s"' 



l-0(s 



l-2a*' 



Let Xz be the random variable defined by 

N{z)=v{X, + z)=vxM- 
Lemma 11. We have lim^^o-^z = 0, almost surely. 



(29) 



Proof. Since A^ is non-increasing and v is (strictly) decreasing, it is easy to see 
that {Xz + z, z > 0) is also a non-decreasing process, almost surely. Therefore 
lim^^o-^z + z > exists, almost surely. Moreover, the above limit equals 
with probability 1, since X^ + z = u{N{z)), and since P{N{0+) = oo) = 1 and 

D 



liniT 



m(x) = 0. 



Due to (l24l) and (!28|) . we have in particular that 



A^ = i sup 

ti,t2e[z,s 



log 7-T - log ■ 



t2 



After plugging in ti = z, we obtain 

N{t) , f'\ij{N{r)) ij{vx.{r)) 



jjjNjr)) i^ivxAr)) 
N{r) vxAr) 



dr 



<4s" 



Az C { sup 

t£[z,s] 



log 



vxM 



N{r) 



vxAr 



dr 



<4/ 
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Lemma [9] implies the hypotheses of Lemma [TO] omega-by-omega (with a = z,b = s 
and the obvious choice of / and g), therefore 



A^{s) = A^ c < sup 



te 2,s 



log 



Nit) 



vxM 



<8s" 



(30) 



By fixing t < s and varying z G (0, t] (note that log 
we obtain 






f^ze{o,s)Az{s) C <; sup 

z,z'e{o,s),te[zvz',s] 



log 



vxM 



Vx, (t) 



< 16/ 



which together with fl30l) implies 



(^ze{o,s)Az{s) c <{ sup 
te{o,s] 



log 



iV(t) 



lim„ vx,^ (t) 



< 24 s' 



where {zn)n>i is any given deterministic sequence of strictly positive numbers 
converging to 0. Due to Lemma [TTl the continuity off implies lim„^oo "^x^^ (^) = 
v(t), Vt G (0, s], almost surely. To summarize, we have just proved 

Proposition 12. //supp(A) C [0,1/4], then 



P I sup 

te(o,sAT„g] 



log 



N{t) 



vit) 



< 24s"* I > P{n,eio,s)A^i^)) = 1-0(5 



l-2Q*^ 



Theorem [T] now follows due to the Borel-Cantelli lemma, after choosing a deter- 
ministic sequence {sm)m>i of strictly positive numbers converging to sufficiently 
fast so that J2m(^mY~'^°'* ^ °°- 

Remark 13. The fixed scale assumption A[0,1](= A [0,1/4]) = 1 has not been 
used in the above argument. 

4.2 Relaxing assumptions on supp(A) 

Given a probability measure A on [0, 1] and a positive 1] < 1 define its restriction 
A, by 

Ari{dx) = A{dx)l[o^r]]{dx) . 

For each r] G (0, 1], denote by ipri the function ip\^ that corresponds to A^ (cf. ([3])), 
and by Vr^ the corresponding rate function from ([8]). 
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Lemma 14. All the Ay-coalescents, where rj G (0, 1] have the same speed of CDI. 
Moreover, for any fixed f] G (0, 1), 

lini^ = l. (31) 

i^O Vr,{t) 

SO it suffices to prove Theorem U\ for one rj G (0, 1) in order to prove it for all 
r/G (0,1]. 

Proof. Fix rj G (0, 1). Assume first that A({0}) = 0. Then it is easy to see that one 
can find a couphng of the two coalescent processes defined by A and by A^ respec- 
tively, such that the corresponding coalescent block counting processes N^ and 
A^"^" coincide for all t G (0, T^) where P{T.ri > 0) = 1. Namely, recall the PPP con- 
struction of Section [22] and set T^ := min{t > : {t,p) is an atom of vr and p > 

v}- 

If A({0}) > 0, let A'{dx) = A(c?x)l(o,i)(x), and note that the PPP-based 
construction of A'-coalescent can be enriched by superimposing pairwise coalescent 
events at rate A({0}), thus yielding a construction of A-coalescent. Again, one 
can couple such constructions of A-coalescent and A^-coalescent so that the two 
processes agree until T^ as discussed above. 

To prove the lemma, it now suffices to show fl3T|) for any fixed rj G (0, 1). Note 
that we trivially have v{t) < v^{t) for all t > 0, since tfjn^q) < il^{q) for all g > 0. 
Moreover, 

^r,(g) = ^(g)-a,g + 6, + 0(e-''''), 

where a,, := /, ^Al/x)K{dx) and 6^ := L ^Al/x'^)K{dx). Therefore for any < 

z<t. 



. v(t) , v(z) 
log^-log^ + 

Vrj{t) V^{Z) 



f(r) Vji{r) 



+ hJr) 



dr = 0. 



where hz{r) is now a deterministic function, bounded by a fixed constant C, 
uniformly over z. The rest of the argument is a deterministic (and easier) analogue 
of the argument given in Section 14. 1[ We leave it to an interested reader. D 

If A({0}) > 0, then the size of the atom at determines the speed of CDI. 
More precisely. 

Corollary 15. // A({0}) = c> then for all r] G (0, 1] 

v^{t)^— t^O. 
ct 
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Proof. Denote by vq the above function 2/{ct) and note that it corresponds to 

2 



2 

A{dx) = cSo{dx) and ipoil) = ^-, in terms of ([8]). Next note that if 77 G (0, 1] 



then 

2 

where /(g) = o(g^) is a non-negative function. In particular, f,,(t) < fo(i^), t > 0. 
Moreover, since for any e, we can find q{e) < 00, such that 

^^(g) < , for all q > q{e). 

we have by the same reasoning Vn{t) > t'o(t)/(l + e) for all sufficiently small t. 
Letting e ^ implies the statement. D 

4.3 Proof of Theorem [2 

Assume that the parameter no is the maximum of the corresponding quantities 
from Lemma [19] and Lemma [201 Assume initially that supp(A) C [0, 1/4] and fix 
z > 0. With the notation of Section 14.11 in mind, let M^^x^ = M be the process 
given by 

(32) 
Then M^ = 0, and due to Proposition [TJ M is a martingale (in the sense that Mf is 
an integrable random variable, t > z). Note that here we use M as abbreviation, 
the above process should not be confounded with M from f[TB]) . 

We next obtain better estimates on the tails of the distribution of Mt, via 
an analogue of Hoeffding's inequality [15] for discrete martingale sums. Since M 
has only downward jumps, a simple case of a general result of Barlow et al. [2] 
Proposition 4.2.1 (see also ^25j) implies that for any c > 0, 

^(^):=(exp|cM,-^!^^^},t>.), 

(c) 

is a super-martingale started from Sz = 1, with respect to the usual filtration 
J-". Note that Dt in [21 [25] corresponds to E[{dMty\J^t] in our notation, and that 
C is the uniform upper bound from Proposition [7[ 

Fix some x G M+. Let c = x/{C{s — z)), and y = exp{cx/2} = exp{cx — 
c^C(s — z)/2)}, and let Ty = inf{t > z : S^ > y}. Since S^^^ only has downward 
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jumps, it must be Srp = y on {Ty < oo}. Since S^'^^ is supermartingale, using 
optional stopping at Tj, A s, we have 

1 = E (5(^)) > E (Sg,,) 

= yP{Ty<s)+E{Si^hT,>s) 

> yP{Ty < s). 

It follows that: 

P I sup Mt > X I < P I sup Sl'^ > e^^-^'^(«-^)/2 

\tG[2,s] J \te[z,s] 

< P{Ty < s) 

1 r x^ 

In order to obtain the "left tails" we use |2] Proposition 4.2.1 in a less trivial 
sense: if c > then 



£!^4Ba.m)|<,.) 



^(-^) := exp <^ -cMt - 
is a super-martingale, where A^M = M{s) — M(s—) = AslogA^(s A t"„,)). Define 



E('=)(t) := exp!cJ2 {^tMf - e"''^ Kq (t - z) 



te z,. 



exp\cJ2 (AtlogiV(s A r„J)2 - e'^/' Ko (t 



te z,. 



where Kq is the constant from Lemma [201 Due to Lemma [20l we have that for each 
c > 0, the process {E^'^\t),t > 2;) is a non-negative super- martingale started from 
E^'^\z) = 1. Indeed, it is easy to verify in the sense of calculations of Proposition 
El that 

E{dE^'\t)\J^t) = E^'\t) ■ E[exp{c{AtM)^} - l\J^t] - e^'/^Ko ■ E^'\t) dt 
< E^'\t) 






dt = 0, 
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almost surely. To include the case A({0}) > in the above calculation note that 
by a standard estimate (!50l) and Taylor's series expansion 



n\ , ., 2/' -N-N. -x c _/c e 



(exp{clog"((n - l)/n)} -l) = - + 0[- + 



n n" 



Without loss of generality one can assume that both Kq > 1 and c/2+0 {c/n + e'^/n^) < 
g9c/4 £qj, n > uq and all c > 0. 
Then, for x > 0, we have 



P{ inf Mt < -X 



< P I Jnf ^ Mi < -X, c^ Y^ (A,M)2 < ex I + P I ^ (AiM)^ > x/c 

te[2,s] 



te z.s 



iG[2,s] 



< P I sup Sf-^) > e-/2--'^(--)/2 I + P I sup E(^')(t) > e 



xc-e9'= /4Xo(s-^) 



, te u.s 



^ g-cz/2+c2C(s-z)/2 I g-xc+e9<= /'^i^oC'^-z) 



We plug in c = |iyiog[x/(-K'o('5 — z))] (here we assume that x > 2Kq{s — z)). 
Since in each exponent the second term is negligible when compared to the first, 
we get the sub-exponential estimate 



P inf Mt< -x] =0 (r(x: s - z)) 

\t€\z,s] ' 



where 



r[x; s) := exp <; -XA/log[x/(Kos)]/4| . 
Now another omega- by-omega application of Lemmas [9] and [10] yields 



1 -0(r(x;s-z)) < P\ sup |Mi| < X 

te[z,s] 



< P I sup 

tG[2,s] 



log 



N{t A r, 



"oy 



<2(x + Cs) 



v{X, + tATno) 

Since lim2_^o'y(-^2 + t) = v{t) as argued before, in the limit we obtain 

N{t A r„ 



P I sup 

<e[o,s] 



log 



v{t A r„ 



<2{x + Cs) > l-0(r(x;s)). 



(33) 
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Note that since A^ is an integer-valued process and f is a decreasing function, 
mite[o,s]'^og{N{t)/v{t)) > mUe[o,sAT„g]'^og{N{t)/v{t)) - log no, almost surely. Now 
( !33l) together with the observation N{t) < N{t A r^,,) implies that the random 
variable 



sup 

te[o,s] 



log 



Nit) 



v(t) 

satisfies P(Ss > x) = 0{r{x] s)), hence 
N{t) 



log I sup 

te[o,s] 



N{t) 



v{t) 



V sup 

te[o,s] 



v{t) 



Nit) 



sup 

te[o,s] 



vit) 



>y\ <o 



^^\og\og(y)-\og{Kos)/i 



as y 



oo. 



In particular, for any (i > 1 we can find a constant D{d) < oo such that 




N{t) 



vit) 



< D{d), 



(34) 



hence (for a possibly different constant) ^'(supjg^Q ,,] \N{t)/v{t) — 1\'^) < D{d). 
Now the almost sure convergence of Theorem [1] combined with an application of 
dominated convergence theorem completes the argument. 

For the case of general supp(A), recall the notation of Section 14^21 In addition, 
denote by A^i/4(t) the number of blocks process corresponding to A1/4. Due to the 
coupling construction used in the argument of Lemma [HI we have 



and moreover 



Ny,{t) > N{t), t > 0, 
Vi/4(t) 



sup 

te[o,s] v{t) 



< 00. 



Therefore (134|) for the Ai/4-coalescent will imply the same estimate (with possibly 
different constant D[d)) for the A-coalescent. 



4.4 Proof of Theorem [5] 

Recall the notation t„ = u^{n) = u{n) introduced before the statement of Theo- 
rem [5l It suffices to show that any subsequence {nk)k>i contains a further subse- 
quence {nk[j))j>i such that 



rAr^."feO)(t)rft 
lim — 

3^00 ^ v{tn^^^^ +t)dt 



1 = lim 



J^ N^'^^u) (t) dt 



3^00 j^E{N^''"'(^){t)) dt' 



almost surely. (35) 
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For t > define 



MJ" := log 






Mr" 



"0 /^(A^A'"(r)) ^iv{tn + r)) 



^A,n( 



f (t„ + r) 



+ /i"(r) rfr 



(36) 

where h'^ is the drift compensator of Iog(A^^'"') with respect to the filtration gen- 
erated by the underlying A-coalescent, and where 

r^^ := inf{s > : iV^'"(s) < Hq}. 

Then M" in (!36l) is a direct analogue of martingale (!32|) . In particular, note that 
by definition of t„, Mq = 0, and as in (1251) 

E{{M]}f) < Ct. 

Recall 7:„g defined in ( TT7|) . and note that with probability 1, r^^^ increases to r„g 
as n — * oo. The arguments leading to Proposition [T2] apply in the current setting 
to yield for a fixed a* < 1/2, and for all n (for n < tiq the result holds trivially). 



P I sup 

te[o,s] 



P I sup 

te[o,s] 



log 
log 



iV^'"(tAC) 



^(^n + ^Ar-) 
iV^'"(tAr„-J 



^(^n + tAr") 



< 24s"' > 1 - 0(si-'"*), and (37) 



<2{x + Cs)]>l-0{r{x;s)). (3^ 



Fix some subsequence (nfc)fc>i.We now show the first convergence statement 
in (!35|) . Choose any sequence of positive numbers Sj decreasing to so that 



E^r'"' 



< oo. 



(39) 



Next choose a further subsequence of {nk)k>i, denoted again by {nj)j>i to simplify 
notation, so that 



r^ J^.v{tn^+t)dt 
lim / v(tn + t) dt = oo, lim -^TT^ — ; — ..... „„. 






(40) 

where the last limit is taken in the almost sure sense. Note that here we use 
observations flT3|) - flT^ and the following straightforward facts: for any fixed < 
a <b < s, J^v{tn^ +t)dt ] J^v{t)dt and J^^ N^'""^ (t) dt T J^N^{t)dt. Due to 



37|) . (!39l) and the Borel-Cantelli lemma, we have 



lim sup 



V{tn, + t) 



0, almost surely. 



(41) 
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The first statement in (135|) now follows by a simple calculus fact: if (/n)n>i, 
{gn)n>i, fni9n '■ [O, s] -^ [0, oo), are two sequences of integrable functions such 
that for some positive sequence 5„, — > it is true that 



lim 

n— >oo 



fn{t)dt 



CXD, 



lim 



and 



then 



lim sup 



!iUt)dt 

fnit) 



9n{t) 
y Iofn{t)dt 

lim -Ys — T^—r 
"^°° Jo 9n{t) at 



lim 



0, 



Is„9n{t)dt 



rS-n 



Jo" 9n{t)dt 



0, 



1. 



For the second convergence statement in ( l35l) . note that (similarly to the ar- 
gument for Theorem [2]) , almost sure convergence (HTl) together with estimate (l38l 
and the dominated convergence theorem, yield 



lim sup 



EN^^""^ (t) 



-1 



0, almost surely. 



V{tn, + t) 

Note that without loss of generality we may assume that 

J^'EN^'''^{t)dt 
lim -^. — -- — = 0. 



(42) 



(43) 



The previous argument applies. 

The final statement of Theorem [5] will follow from Corollary[T6l which is stated 
and proved in next subsection. 



4.4.1 Discussion on almost sure convergence 

It is an open question whether the convergence of Theorem [5] holds almost surely. 
Our technique seems too crude to verify it in general, yet we offer below a partial 
result in this direction. One standard approach would be to use the monotonicity 



PS ps f 

/ Ar^'"(t)rft< / Ar^'"+i(t)rft, and / 
Jo Jo Jo 



V{tn + t)dt< / V{tn+l+t)dt. 

Jo 

It would suffice to find a subsequence Uj along which convergence holds in the 
almost sure sense, and in addition such that 






1. 



(44) 
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lim / 



Corollary 16. Assume that a* < 1/2 is fixed, and that two sequences {sj)j>i and 
{nj)j>i are given, where Uj is non- decreasing. If in addition to (E^j and ^^44\ l> '"'^ 
have 

/;, v{tn, + t) dt 

v{tn. +t)dt = oo, lim -d—- — — - = 0, and (45) 

i^°° Jq v{tn^ + t) dt 

r Ht) dt 

lim -j^ — - < oo, (46) 

^'^°° Js, ^(^ +t)dt 

then the convergence of Theorem holds almost surely. 

Proof. As discussed above, due to (14^ and monotonicity, it suffices to show con- 
vergence as stated in Theorem O along the sequence {nj)j>i. Due to the Borel- 
CanteUi lemma, dSTD^dSH]),© and the fact 

P(limsup{r^J <s,}) = 0, 
i 

we have, as for Theorems [1] and [21 that 

['' AT^'"^ (t) dt ['' EN^'""^ (t) dt 
lim -rf- — ^ — = 1, almost surely, and lim -f^- — ^ = 1. 

i-- j;^ v{tn^ +t)dt J-- /;^ V{tr,^ + t) dt 

Due to ( H5l) . we have 

/' AT^'"^- it) dt J" EN^^^^ (t) dt 
liminf-r:^^ — — > 1, almost surely, and liminf-^^ — — > 1. 

j^oo j^ v(tnj + t) dt j^oo J^ v{tn^ + t) dt 

For the corresponding upper bound on the lim sup, note that due to Theorem [1] 
(resp. Theorem [2]) there exists a positive finite random variable Cq (resp. positive 
constant Co) such that 

J' N^{t)dt f f EN^{t)dt \ 

\v{t)dt - ' + ^°' ""' 1,^'^P- >%(t)^t ^ 1 + ^° j ' f°^ ^" ^ ^ '■ 

Due to fH5l) - fH6l) and monotonicity A^^'"-'(t) < N^{t) (with probability 1), we now 
have both 

Jl A^^'"^ (t) dt Jl EN^'""^ (t) dt 
lim -^ — ; — = almost surely, and lim -d. — ; — = 0, 

J ^'V(tn^+t)dt J^' V{tn^ + t) dt 

which completes the argument. D 

Taking for example a* = 1/4, Sj = 1/j^, and Uj = exp(log^ j) (resp. rij = j^) 
in the case of Kingman (resp. Beta) coalescent, one can verify (left to the reader) 
the hypotheses of the last corollary, implying the final statement of Theorem [51 
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5 Appendix: Binomial calculations 

Lemma 17. If X has Binomial(n,p) distribution and ifY = X — l{x>o}; then: 

(i) EF = np-l + (l-p)", 

(ii) var(y) = np{l - p) + (1 - p)"(l - (1 - pT) - 2np{l - p)"", 
(Hi) EY^ = -np - np^ + nV + 1 - (1 - pT- (47) 

Proof. Property (i) is trivial, (ii) follows easily from the fact that 

cov(X, l{x>o}) = np{l - p)", 

and (iii) is implied by (i) and (ii). D 

Corollary 18. If X has Binomial(n,l-p) distribution and if Y = X + l^x<n}, 
then 

2 



E 



n-Y 



n 



0{p' 



Proof. Note that n — Y has the distribution of the variable Y from Lemma [171 
Hence its second moment is given in (HTll . Since for p < 1/n we have 



2^2 \ 



(1 - p)" = l-np + 0{ny 

the claim of the corollary is true in this case. Now if p > 1/n then np = 0{n^p^) 
therefore the largest term in (l47j) is again of order n^p^. D 

Lemma 19. There exists wq € N and Co < oo such that for all n > uq and all 
p < 1/4, if X has Binomial(n,l-p) distribution, then 



E[log(X + l|x<„})-logn] 



np — 1 + {1 — pY 



n 



<Cop' 



and 



E[{\og{X + l|x<n}) - lognf] < Cop', 
Proof. Let Y = n — X as before, and abbreviate 



T = Tn:= log(X + l{x<n}) - logn = log ( 1 - 

We split the computation according to the event 

An = {Y< n/2}, 



Y -1 



{Y>0} 



n 



(4J 
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whose complement due to a large deviation bound has probability bounded by 

1 



,1, 1 1, 
exp < —n - log - — h - lo^ 



2 ^2p 2 ^2{l-p) 



2>"/2(^_p)n/2 



(49) 



uniformly in p < 1/4 and n. On A^ we have |T| < logra, and on A^ we apply a 
calculus fact, 



logfl — x) + x\ < 



X 



2(1 -xl 



<x2, xe [0,1/2], 



(50) 



to obtain 

E[T] + E 



Y-1 



{y>o} 



n 



U„ 



<(iogn)p(A;;) + E 



(r-i 



{y>o}J 



n^ 



la„ 



Furthermore, since (F — l{y>o})/'^ < 1, we conclude 



E[T] + E 



F-1 



{y>o} 



n 



< {\ogn + l)P{A'^) + E 



{Y - l|K>o})^ 



n" 



(51) 



Note that by Corollary [T8l and Lemma fTTl (i), in order to prove the first estimate 
of the lemma, it remains to show 



(logn)P(A^) < (logn)2V^'(l -P)"^' < Cp" 



(52) 



for some C < oo, all p G [0, 1/4], and all n large. Now consider f : p \-^ {p{l — 
p))"/^/p^. Its derivative at p equals g{p){n(l — 2p)/2 — 2(1 — p)) where g{p) is a 
positive function. It is easy to check that if p < 1/4, then n(l— 2p)/2 — 2p^(l— p) > 
for all 77, > 6. Therefore / is an increasing function of p, so in order to verify 
( 1521) for all p < 1/4, it suffices to check it for p = 1/4. This corresponds to having 
(logn) 2"(3/16)"/2 < c/16, that will hold for all large n = n{C) given a C > 0. 

For the second estimate, again use the partitioning according to An and fl50l) 
to obtain 



ET'<E[T'1aJ+ log' nP{A':,)<[-] E 



{Y - l|K>o})^ 



n^ 



+ log2 nP(A^), 



which differs from ( 15T|) only by an extra factor of order logn multiplying P{A'^), 
so the previous argument carries over. D 

Lemma 20. There exists no G N and Kq < oo such that for all n > uq, p < 1/4 
and c> 0, if X has Binomial(n,l-p) distribution, then 

^[exp{c[log(X + l{x<n}) - logn]2} - 1] < e^'^/^i^op', 
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Proof. The strategy is the same as used for the second estimate in the previous 
lemma, some details are left to the reader. 
Recall that Y = n ~ X and observe that 



^clognp/ AC 



cT^ 



E[e'^' -1] < n^^°s"P(A^)+E[(e'=^ -1)1a„ 






exp < c 



exp < c 



9( r-l|y>o})- 

9(y - i{Y>o}) 

A-n? 



11. 



-1 



Hence it suffices to show that for some Kq, all c > and all n, p as specified 
above, we have 



E 



exp < c 



{y - My>^}) 



n^ 



< e'Kop'. 



(53) 



Without loss of generality, one can assume that c > 1 . 
The left-hand-side above 

n-k f^cik-lf/n"^ _ i 



fc=l ^ ^ 



can be bounded, using the Taylor's expansion, by 



fc=i 



E CVd-rt 



„-* J i±z2l ^ e' {k-lY 



n" 



n^ 



c . ,,2 „.. e- 



n^ 



:(E(r - 1)^ - P{Y = 0)) + —AE{Y - 1)^ - P(F = 0)). 



2n4 



(54) 



Next compute 



E{Y - If - P{Y = 0) = Var(r - 1) + {E{Y - 1))^ - P{Y = 0) 

= np{l -p) + {np - if - (1 - p)" (recall ^) 

< np{l -p) + {np - if - e""^ + 2np'^/3 

<(np)2 + 0(V), (55) 

where for the last inequality we recall that e~^ — 1 + x > for x > 0. Similarly, 
using the fact 

{y - If = y{y -Ify- 2){y - 3) + 2y{y -l){y-2) + {y- if, 
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as well as the expressions for Binomial factorial moments, we have 

E{Y-1)'^-P{Y = 0) (56) 

= n{n - l){n - 2){n - 3^ + 2n{n - l)(n - 2)p^ + E(Y - if - P{Y = 0) 
< n^ + 2n^p^ + (npY + 0{np'^) 
<4nY + 0{np'^). (57) 

Now (l5^ - fl57j) yield (!53|) . and therefore the statement of the lemma, with appro- 
priately chosen no- □ 
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